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1 Introduction 

Let d = (di, • • • , dq)' £ (—1/2, 1/2) 9 and let B be the shift operator. Consider the (/-dimensional 
weakly stationary process {Xt}^ obtained as a stationary solution of the difference equations 

diag {(l-B) d "}(X t -E(X t ))=Y t , (1.1) 
ke{i. •••,<?} 

where {Y t }^Q is a (/-dimensional weakly stationary process whose spectral density function /y 



is bounded and bounded away from zero. Each coordinate process in (1.1) exhibits long-range 
dependence whenever the respective parameter di > 0, in the sense that the spectral density 
function satisfies /(A) ~ KX~ 2di , as A — > + , for some constant K > and !£{!,■■■ , q}. 



Processes of the form (1.1) constitute the so-called fractionally integrated processes. As a 
particular case, consider the situation where the i-th coordinate process {^^}^. f° nows an 
ARMA model. In this case, the associated coordinate process {X^ will be a classical 
ARFIMA process with the same AR and MA orders and differencing parameter di. If the 
process {^f}^ ^ s a vectorial ARMA process, then the resulting multivariate process will be 
the so-called VARFIMA process with differencing parameter d = (di, ■ ■ ■ , d q )' . VARFIMA and, 
more generally, fractionally integrated processes, are widely used to model multivariate processes 
with long-range dependence. See, for instance, the recent work of Chiriac and Voev (2011) on 
modeling and forecasting high frequency data by using VARFIMA and fractionally integrated 
processes. 



The parameter d in (1.1) determines the spectral density function behavior at the zero 
frequency as well as the long run autocovariance/autocorrelation structure. Hence, estimation 
becomes an important matter whenever the long run structure of the process is of interest. 

Estimation of the parameter d in the multivariate case has seen a growing interest in the 
last years. A maximum likelihood approach was first considered in Sowell (1989), but the com- 
putational cost of the author's method is very high. A few years later, Luceho (1996) presented 
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a computationally cheaper alternative for the maximum likelihood approach based on rewriting 
and approximating the quadratic form of the Gaussian likelihood function. In a recent work, 
Tsay (2010) proposed an even faster approach to calculate the exact conditional likelihood based 
on the multivariate Durbin-Levinson algorithm. Although the maximum likelihood approach 
usually provides good results, it is still a computationally expensive method. 

The works of Fox and Taqqu (1986), Giraitis and Surgailis (1990), among others, provided 
a rigorous asymptotic theory for (univariate) Gaussian parametric estimates which includes, for 
instance, n 1 / 2 -consistency and asymptotic normality. One drawback is the crucial role played by 
the Gaussianity assumption in the theory, which also requires strong distributional and regularity 
conditions and is non-robust with respect to the parametric specification of the model, leading 
to inconsistent estimates under misspecification. 

In the univariate case, Gaussian Semiparametric Estimation (GSE) was first introduced 
in Kiinsch (1987) and later rigorously developed by Robinson (1995b). It provides a more 
robust alternative compared to the parametric one, requiring less distributional assumptions 
and being more efficient. In the multivariate case, Robinson (1995a) was the first to study and 
develop a rigorous treatment of a semiparametric estimator. A two-step multivariate GSE has 
been studied in the work of Lobato (1999), which showed its asymptotic normality under mild 
conditions, but without relying on Gaussianity. A few years later, Shimotsu (2007) introduced 
a refinement of Lobato's two-step GSE, which is consistent and asymptotically normal under 
very mild conditions (Gaussianity is, again, nowhere assumed), but with smaller asymptotic 
variance than Lobato's estimator. The technique applied in Shimotsu (2007) was a multivariate 
extension of that in Robinson (1995b), powerful enough to show not only the consistency of the 
proposed estimator, but also the consistency of Lobato's two-step GSE. Recently, Nielsen (2011) 
extended the work of Shimotsu (2007) to include the non-stationary case by using the so-called 
extended periodogram. 

The estimator introduced in Shimotsu (2007) is based on the specification of the spectral 
density function in a neighborhood of the zero frequency. Estimation of the differencing pa- 
rameter d is obtained through minimization of an objective function, which is derived from the 
expression of the Gaussian log-likelihood function near the zero frequency. To obtain the ob- 
jective function, the spectral density is estimated by the periodogram of the process. Although 
asymptotically unbiased, it is well known that the periodogram is not a consistent estimator of 
the spectral density and do not even converge to a random variable at all, cf. Grenander (1951). 
Some authors actually consider the periodogram "an extremely poor (if not useless) estimate of 
the spectral density function" (Priestley, 1981, page 420). 

Our contribution to the theory of GSE is two- folded. First, being consistency a highly 
desirable property of an estimator, we study the consequences of substituting the periodogram in 
Shimotsu (2007) 's objective function by an arbitrary consistent estimator of the spectral density 
function. We prove the consistency of the proposed estimator under the same assumptions as in 
Shimotsu (2007) and no assumption on the spectral density estimator other than consistency. 
Second, considering Shimotsu (2007) 's objective function with the periodogram substituted by 
an arbitrary spectral density estimator, we derive necessary conditions under which GSE is 
consistent and satisfy a multivariate CLT. Gaussianity is nowhere assumed. In order to assess 
the finite sample properties of the estimators studied here and its competitiveness, we present 
a simulation study based on simulated VARFIMA process. We apply the so-called smoothed 
periodogram as an estimator of the spectral density function. 

The paper is organized as follows. In the next section, we present some preliminaries concepts 
and results necessary for this work. In Section 3 we introduce the Smoothed GSE as well as a 
general class of estimators based on appropriate modifications of the GSE's objective function. 
Section 4 is devoted to derive the consistency of the proposed estimator while in Section 5, its 
asymptotic normality is studied. In Section 6 we present some Monte Carlo simulation results to 
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assess the finite sample performance of the proposed estimator. Conclusions and final remarks 
are reserved to Section 7. Proofs are presented in Appendix A. 

2 Preliminaries 



Let {-X"f}^ be a (/-dimensional process specified by (1.1) and assume that the spectral density 
matrix of Yt satisfies fy ~ G for a real, symmetric, finite and positive definite matrix G. Let / 
be the spectral density matrix function of Xt, so that 

E[(X t -E(X t ))(X t+h --E(X t ))']= f e ihX f(X)d\, 

J — 7T 

for h € IN* := 1N\{0}. Following the reasoning in Shimotsu (2007), the spectral density matrix 
of X t at the Fourier frequencies Xj = 2-irj/n, with j = 1, • • • ,m and m = o(n), can be written 

as 

f(Xj) ~ A.j(d)GAj(d) ', for A 3 {d)= diag {A<f\d)} and \f \d) = A7 d V (7r - A > )<W2 , (2.1) 

fee{i, •••,<?} 

where, for a complex matrix A, A denotes the conjugate transpose of A. Let 
w n (X) := — == V X t e itA and J„(A) := w n (X)w n (X) ' 

be the discrete Fourier transform and the periodogram of Xt at A, respectively. In order to define 
the smoothed periodogram of Xt, let W n (k) := (W? (fc))^ =1 be an array of functions (called 
weights) and {^(A;)}^ =0 be an increasing sequence of positive integers satisfying the following 
conditions 

Al. \/l[n) + £(n)/n — > 0, as n tends to infinity; 
A2. WH{k) = W'n j {-k) and WH (k) > 0, for all k; 
A3. E\ k \<e(n)Wi?(k) = l; 

A4. Yl\k\<e(n) Wn {k) 2 — > 0, as n tends to infinity. 

Conditions Al - A4 are standard in the literature of smoothed periodogram. See for instance 
Priestley (1981). For a Fourier frequency Xj, let 

+k)w n (X : j + k) , (2.2) 

\k\<i(n) 

be an estimator of the spectral density matrix of Xt, where denotes the Hadamard product. 
If, for some j and k, Xj + k £ [— it, tt], we take w n as having period 2ir. The function f n can be 
extended to take values on R. In order to do so, we need to introduce the following auxiliary 
function. For a fixed n > 0, given A G [0,tt], there exists ko £ IN"* such that Afc _i < A < Afc . 
Define 

g(\- n) := { ^o -1 ' if ^ _ ^feo-i < ^fco _ ^1 
\ ^iji if A — Afc _i > Afe — A. 

If A £ [— 7r,0), we define g(X;n) := g(—X;n). Then, if A is not a Fourier frequency, we extend 



( |2.2D by setting 

/«(A) :=/„(fl(A;»)). 



Expression (2.2) extended in this way is a multivariate extension of the univariate smoothed 



periodogram function. In (2.2), we allow the use of different weight functions for different com- 



ponents (including cross spectrum components) of the spectral density matrix. In this manner 
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we accommodate the necessity, often observed in practice, to model different characteristics 
present in the spectral density matrix by using different weight functions. We refrain from 
presenting further discussion on the different types of weight functions in the literature, since 
they can be found in most text books on the subject. See, for example, Priestley (1981) and 
references therein. 

In practice, at zero frequency, we use a slightly different estimative, namely, 



/ n (0) := Rc 



M / n(O)0w„(Ai)w„(Ai) 



2^W n {k) Q w n {\ k+1 
fc=i 



li0 n (Ajfe+i) 



Considering (1.1), under conditions Al to A4, the smoothed periodogram is n 1//2 -consistent 



for the spectral density matrix whenever d G [— 1/2, 0] g . See, for instance, Grenander and 
Rosemblatt (1953) and Priestley (1981). 



3 Smoothed Gaussian Semiparametric Estimation 



From the local form of the spectral density at zero frequency (2.1 ) replaced in the frequency do- 
main Gaussian log-likelihood localized at the origin, Shimotsu (2007) proposed a semiparametric 
estimator for the fractional differencing parameter d based on the objective function 

q 1 _ m . 

R(d) :=log(det{G(d)}) -2^4-X) lo g(Ai), (3.1) 

k=x m j=l 

where 

1 m 

^ :=-EMAi(<r%(A;)AjPF']- (3-2) 

m 3=1 



with Aj(d) defined in (2.1). The estimator of d is then given by 



d — arg min{ i?(d)}, (3.3) 
dee 

where the space of admissible estimates is of the form = [— 1/2 + e±, 1/2 — 62], for arbitrarily 



small ej > 0, i = 1,2, henceforth fixed. In Section 6 we shall denote the estimator (3.3) 



by Sh. Shimotsu (2007) shows that the estimator based on the objective function (3.1) and 



(3.2) is consistent under mild conditions. The proof, however, is complicated and involves 
somewhat delicate results on the periodogram function I n . A natural question is what happens 
if the periodogram /„ is replaced by a consistent estimator of the spectral density function? 
As we shall see later, replacing the periodogram I n by a consistent estimator of the spectral 



density is sufficient to guarantee the consistency of the estimator (3.3) in the case of long-range 
dependence, d G (0, 1/2) 9 . For the case d G (— 1/2, O) 9 , surprisingly, consistency alone is not 
enough, but rjr -consistency, for some /3 G (0, 1), guarantees the consistency of the estimator for 
d G (— P/2, 1/2) 9 . Under extra regularity conditions on f n , the n^-consistency can be relaxed 



(see Theorem 4.2). Proofs are significantly simpler than the original ones. 



Let f n be an arbitrary estimator for the spectral density function /. We are interested in 
estimators based on objective functions of the form 



1 -, m 

S{d) := log (det{G(d)}) -2^-^10^-), (3.4) 

with 



m 

k=i j=i 



d ^ ■=-J2M A 3(d)- 1 fn(X j )'Md) =I ']- (3-5) 



m 
3=1 
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Notice that (|3.4|) is just (3.1) with the periodogram I n in (3.2) replaced by f n . The estimator 



of d is then defined analogously as 



d = argmin{S(d)}. (3.6) 
dee 



The estimator (3.6) based on the smoothed periodogram shall be denoted in Section 6 by SSh. 



Before proceeding with the re sults , we shall establish some notation. Let {Xt}f^ be a q- 



dimensional process specified by (1.1) and let {e^te'Z such that X t — E(X t ) = Ylk=o ^fc £ t-fc- 
We define a function A by setting 

oo 

A(X) :=]Tv4 fe e ifcA . (3.7) 
The periodogram function associated to {ejjtez is denoted by I s , that is, 

i n 

4(A) := w £ (A)w £ (A) where w e (X) := -^=^e t e itA . (3.8) 



'2im t=1 



For a matrix M, we shall denote the r-th row and the s-th column of M by (M) r . and (M). s , 
respectively. 

4 Consistency of the estimator 



Let {-Xtj^Q be a (/-dimensional process specified by (1.1) and / be its spectral density matrix. 
Suppose that the spectral density matrix of the weakly stationary process {Yt}t^Q in (1.1) 
satisfy ./y(A) ~ Go for a real, symmetric and positive definite matrix Go = (Gq s )^, s=1 . Let 
do = (c?5, • • • ,dg)' be the true fractional differencing vector parameter and assume that the 
following assumptions are satisfied: 

Bl. As A ^ 0+, 

/ rs (A) = C "(M)/2 G ^ A -M + o(A -«), for all r, a = 1, • • • , q. 
B2. Denoting the sup-norm by || • ||oo, assume that 

oo oo 

X t --E(X t ) = ^A k £ t _ k , ]T||A fc ||^ <oo, (4.1) 

k=0 fe=0 

where {£f}tez is a process such that 

E(e t |^ t _ 1 )=0 and E(e t £' t \^t-i) = I 9 , a.s. 

for all fGZ, where l g is the q x q identity matrix and &t denotes the o- field generated by 
{e s , s < t}. Also assume that there exist a scalar random variable £ and a constant K > 
such that E(£ 2 ) < oo and P(||£t||^ > rj) < KP(£ 2 > rj), for all rj > 0. 



B3. In a neighborhood (0,5) of the origin, A given by (3.7) is differentiable and, as A — > + , 

i) 



ox 



(^(A)').,=0(A- 1 ||(A(A)')J|J. 



B4. As n — > oo, 

1 TO 

- + ► 0. 

to n 
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Remark 4.1. Assumptions B1-B4 are the same as in Shimotsu (2007) and are multivariate 
extensions of the assumptions made in Robinson (1995b) and analogous to the ones used in 
Robinson (1995a) and Lobato (1999). Assumption Bl describes the true spectral density ma- 
trix behavior at the origin. Notice that, since lim / s i _ 5 . + e lA — 1 = 0, replacing e m ( d r~ d °)/ 2 by 
e i(7r-A)(<4!-d°)/2 ma k es no difference. Assumption B2 regards the causal representation of Xt, 
and more specifically, the behavior of the innovation process which is assumed to be a not neces- 
sarily uncorrelated square integrable martingale difference uniformly dominated (in probability) 
by a scalar random variable with finite second moment. Assumption B3 is a regularity condition 
(also imposed in Fox and Taqqu, 1986 and Giraitis and Surgailis, 1990, among others, in the 



parametric case) and will be useful in proving Lemmas 4.1 and 4.2 below. Assumption B4 is 



minimal but necessary since m must go to infinity for consistency, but slower than n in view 
of Assumption Bl, which specifies / only at a neighborhood of the zero frequency (recall that 
Aj = 2vrj/n, j = 1, ■ • • ,m). 

Observe that assumptions B1-B4 are only concerned to the behavior of the spectral density 
matrix on a neighborhood of the origin and, apart from integrability (implied by the process' 
weakly stationarity property) , no assumption whatsoever is made on the spectral density matrix 
behavior outside this neighborhood. For (3 G [0, 1] and q G IN*, let 



fia := 



2' 2 



n 



1 1 



(2 



(4.2) 



for a > 0,i G {1,2}. 

establishes the consistency of G(do) 



Lemma 



4.1 



given in (3.5) under the assumption of n 



consistency of f n and will be useful in proving Theorem 4.1 



P. 



The proofs of all results in the 



paper, due to their lengths, are postponed to the Appendix A. 



Lemma 4.1. Let {JQ}^ be a q- dimensional process specified by (1.1) and f be its spectral 
density matrix. Let f n be a n 13 -consistent estimator for f. If do G Qp, then 



C7(d ) = G + op(l). 



Theorem 4.1 establishes the consistency of d, given in (3.6), under assumptions B1-B4 and 
assuming n^-consistency of the spectral density function estimator. 



Theorem 4.1. Let {Xt}^ Q be a q-dimensional process specified by (1.1) and f be its spectral 

density matrix. Let f n be a n@ -consistent estimator of f, for (3 G [0, 1], and let d be as in (3.6). 

~ p 

Assume that assumptions B1-B4 hold and let do G Q,p. Then, d > do, as n —> oo. 



Lemma 4.2 will be useful in proving Theorem 4.2 



Lemma 4.2. Let {Xt}^ be a q-dimensional process specified by (1.1) and f be its spectral 
dens ity matrix. Let f n be an estimator of f , and consider the estimator d based on f as in 
(3.6). Assume that assumptions B1-B4 hold. Suppose that f n satisfies 



E 



(n-<-^ f rs iXj) _ (A(\j)) r I e (\j) (A(Xj) X|) = 0(r«-<S-9, 



as n — > oo, 



(4.3) 



for some 7 > 0, for all r,s G {1, • • • , q} and do G @, where A and I e are given by (3.7) and 
(3.8), respectively. Then, for 1 < u < v < m, 

max ( Ve^i-X<M)/2Af ^/-(A,-) - G r A = stf uv + 3§ uv , 
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where £/ uv and 8S UV satisfy 

E(K„|) = Oiv 1 ^) 



max V\v @l uv } = op(l). 

1<-U<v<m 



Theorem 4.2 we derive a necessary condition for the consistency of d given in (3.6), when 



the consistency condition on f n is relaxed. 



Theorem 4.2. Let {X t }^ be a q- dimensional process specified by (1.1) and f be its spectral 



density matrix. Let f n be an estimator of f, and consider the estimator d, based on f n , given in 



(3.6). Assume that assumptions B1-B4 hold. Suppose that f n satisfies (4.3), for some 7 > 0, 



for all r,s G {1, • 

~ P 
Then, d > do, 



,q} and do G Q, where A and I e are given by (3.7) and (3.8), respectively. 



as n 



00. 



Corollary 4.1 establishes the consistency of the estimator (3.6) based on the smoothed peri- 



odogram under assumptions A1-A4. We shall also need an extra minimal condition involving 



the rate of convergence between m in (3.4) and £(n). 



A5. As n — > 00, £(n)/m -4 0. 



Corollary 4.1. Let {Xt}^ be a q-dimensional process specified by ( |1.1[ ) and f be its spectral 
density matrix. Assume that assumptions B1-B4 hold. Let {Wn (k)}f j =1 be a sequence of 
weights satisfying assumptions A1-A5 and let f n be the respective smoothed periodogram given 



in (2.2). For do G B ; consider the estimator (3.6). Then, d > do, as n 



00. 



We observe that assumption A5 is not necessary when d$ € (0, 1/2) q , as it is clear from the 
proof of Corollary |4.1| 



Remark 4.2. Recall that the smoothed periodogram is n 1//2 -consistent (under A1-A4) for do £ 
[— 1/2, 0] g , so that Theorem 4.1 applies and we conclude that the estimator 
-1/2,0] C 



for do G ^1/2 H ( — 1/2,0] C [—1/4,0]. In contrast, as shown in Corollary 4.1 Theorem 4.2 also 
holds and establishes the consistency of (3.6) for all admissible parameter in 0, under A1-A5. 



|3.6[ ) is consistent 
Theorem 



The latter is a far more interesting result, but to obtain it, we had to show that condition 



(4.3) holds (see the proof of Corollary 4.1), which required some cumbersome computations. 



The former result, in contrast, was automatically obtained from the properties of the spectral 
density estimator. Furthermore, Theorem 4.2 does not assume consistency of the underline 
spectral density estimator. In fact, the periodogram itself satisfies condition ( |4.3| ) (see lemma 
1(a) in Shimotsu, 2007). 



5 Asymptotic normality of the estimator 



In this section we present a sufficient condition for the asymptotic normality of the GSE given 



by (3.6), under similar assumptions to Shimotsu (2007), with f n an estimator of the spectral 
density function satisfying a single regularity condition. The asymptotic distribution of the 



estimator (3.6) will be the same as (3.3), established by Shimotsu (2007). 



Again, let { X t }^ be a q-dimensional process specified by ( 1.1 ) and / be its spectral density 



m 



matrix. Suppose that the spectral density matrix of the weakly stationary process {Yt} 
(1.1) satisfies _/V(A) ~ Go for a real, symmetric and positive definite matrix Go = (Gq 
Let do = (^?> " " " id®)' be the true fractional differencing vector parameter. Assume that the 
following assumptions are satisfied 



00 
t=o 
<? 

r,s=l ' 
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CI. For a 6 (0,2] and r,s E {1, • • ■ ,<?}, 

/„(A) = e^^-^^-O^A-^-^GS 5 + 0(\- d "- d "+ a ), as A — > 0^ 

C2. Assumption B2 holds and the process {et}tez has finite fourth moment. 
C3. Assumption B3 holds. 
C4. For any 5 > 0, 

1 m 1+2a log(m) 2 log(n) 

1 ; > 0, as n — > oo. 

C5. There exists a finite real matrix M such that 

A J (d ) _1 v4(A J ) = M + o(l), as -> 0. 



Remark 5.1. Assumption CI is a smoothness condition often imposed in spectral analysis. 
Compared to assumption 1 in Robinson (1995a), assumption CI is slightly more restrictive. It 
is satisfied by certain VARFIMA processes. Assumption C2 imposes that the process {Xt}teiN* 
is linear with finite fourth moment. This restriction in the innovation process is necessary in 
the proof of Theorem |5.1| in applying a CLT to a certain martingale difference derived from 
a quadratic form involving {et}tez, which must have finite variance. Assumption C4 is the 
same as assumption 4' in Shimotsu (2007) and is slightly stronger than the ones imposed in 
Robinson (1995b) and Lobato (1999) (see Shimotsu, 2007 p. 283 for a discussion). It implies 
that (m/n) b = o(m~2^ log(?n) _ a), for 67^0. Assumption C5 is the same as assumption 5' in 
Shimotsu (2007) and is a mild regularity condition in the degree of approximation of A{\j) by 
Aj-(do). It is satisfied by general VARFIMA processes. 



The next lemma will be useful in proving Theorem 5.1 The proofs of the results in this 
section are presented in Appendix A. 



Lemma 5.1. Let {Xt}^ be a q- dimensional process specified by (1.1) and f be its spectral 



den sity matrix. Let f n be an estimator of f, and consider the estimator d, based on f n , given 
e that assumptions C1-C5 hold. Suppose that f n satisfies 

v / d"+d° \ 

E - (A(A i )) r .J.(A i )(5(Ad , )..| = ^( log( ^o + Ig- 1/2 j. 



3=1 



for all r, s G {1, • • ■ , q}, 1 < v < m and do G 0, where L e and A are defined in (3.8) and (3.7), 
respectively. Then, 



(a) uniformly in 1 < v < m, 



(b) uniformly in 1 < v < m, 

max lj^^M-*W\f^&\\j)-G?) =0 F f^+«V 2 log( w) N ). (5.3) 
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The next theorem presents a necessary condition for the asymptotic normality of the GSE 
given in (3.6). We notice that the variance-covariance matrix of the limiting distribution is the 
same as the estimator in (3.3), as derived in Shimotsu (2007). 



Theorem 5.1. Let {Xt}^ Q be a q-dimensional process specified by (1.1) and f be its spectral 
den sity matrix. Let f n be an estimator of f , and consider the estimator d, base d on f n , given 
in (3.6). Assume that assumptions C1-C5 hold. Suppose that f n satisfies (5.1), for all r,s 6 

{!,-•• , q} and do 6 8. If d — > do, for do € 0, then 



,1/2 



(d-do) — A^fT 1 ), 



as n tends to infinity, where 

Q := 

with I q the q x q identity matrix. 



Go © Gq" 1 + l q + ^— (G Gq 1 - I q ) 



6 Monte Carlo Simulation Study 

In this section we present a Monte Carlo simulation study to assess the finite sample performance 



of the proposed estimator given in (3.6) (denoted SSh). For comparison purposes, we also 
calculate the estimator ( |3.3[ ) (denoted Sh). All Monte Carlo simulations are based on time 
series of fixed sample size n = 1,000 obtained from bidimensional Gaussian VARFIMA(0, d, 0) 
processes for several different parameters d and correlations p E {0,0.3,0.6,0.8}. We perform 
1,000 replications of each experiment. To generate the time series, we apply the traditional 
method of truncating the multidimensional infinite moving average representation of the process. 
The truncation point is fixed in 50,000 for all cases. 



In all simulations, we apply the smoothed periodogram (2.2) with the same weights for all 



spectral density components, given by the so-called Bartlett's window, that is, we use 

:= Sln2(n 2 fc/2) ; > for all i, j = 1, 2. 
n V ' 27msin 2 (fc/2) 

The truncation point of the smoothed periodogram function is of the form £(n, B) := \n^\ , for 
/3 G {0.7,0.9}, while the truncation point of the estimators SSh and Sh are both of the form 
m := m{n) = \n a \, for a G {0.65,0.85}. 

The routines are implemented in FORTRAN 95 language optimized by using OpenMP di- 
rectives for parallel computing. All simulations were performed by using the computational 
resources from the (Brazilian) National Center of Super Computing (CESUP-UFRGS). 

reports the simulation results for d £ {(0.1, 0.4), (0.4, 0.4), (0.2, 0.2), (0.2, 0.3)}. 



Table 



6.1 



Presented are the estimated values (mean), their standard deviations (st.d.) and the mean 
square error of the estimates (mse). 

We observe that in all cases both estimators perform well, but the SSh usually present 
a slightly better performance in terms of mse compared to the Sh estimator. In terms of 
mse, the best combination of (a, f3) for the SSh estimator is (0.85, 0.9) in all cases. For the 
Sh estimator, we found that a = 0.85 produces better results than a = 0.65, applied in the 
simulations presented in Shimotsu (2007). As the correlation in the innovation process increases, 
the estimated values slightly degrade. 



In Table 6.1 we notice that the trade off between variance and bias, an usual feature in 
smoothing the periodogram, has little effect in the estimated values. In most cases, we observe 
that the higher the value of 8, the higher the variance but smaller the bias of the estimate. 
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Table 6.1: Simulation results of the estimator (3.6 1 based on the smoothed periodogram with the Barttlet's 
spectral window (weights) and estimator (3.3 1 in VARFIMA(0, d, 0) processes. Presented are the estimated 
values (mean), its standard deviation (st.d) and the mean square error of the estimates (mse). 



p 


Method 


P 


di 


d = (0.1,0.4) 


d = (0.4,0.4) 


a = 0.65 


a = 0.85 


a = 0.65 


a = 0.85 


mean st.d. mse 


mean st.d. mse 


mean st.d. mse 


mean st.d. mse 





SSh 


n 7 

U. / 


di 
d 2 


0.1030 0.0544 0.0030 
0.4467 0.0789 0.0084 


0.0955 0.0268 0.0007 
0.4140 0.0402 0.0018 


0.4506 0.0709 0.0076 
0.4453 0.0781 0.0082 


0.4119 0.0366 0.0015 
0.4138 0.0401 0.0018 


0.9 


di 
d 2 


0.1047 0.0447 0.0032 
0.4230 0.0539 0.0044 


0.0953 0.0268 0.0007 
0.3949 0.0306 0.0010 


0.4312 0.0601 0.0046 
0.4228 0.0624 0.0044 


0.3944 0.0291 0.0009 
0.3948 0.0306 0.0010 


Sh 




di 
d 2 


0.1053 0.0581 0.0020 
0.3955 0.0606 0.0022 


0.0955 0.0271 0.0005 
0.3822 0.0292 0.0006 


0.4136 0.0590 0.0020 
0.3955 0.0606 0.0021 


0.3851 0.0277 0.0008 
0.3822 0.0292 0.0007 


0.3 


SSh 


n 7 

U. / 


di 
d 2 


0.1100 0.0513 0.0027 
0.4468 0.0741 0.0077 


0.0982 0.0257 0.0007 
0.4148 0.0384 0.0017 


0.4497 0.0648 0.0067 
0.4478 0.0714 0.0074 


0.4118 0.0346 0.0013 
0.4142 0.0375 0.0016 


0.9 


di 
d 2 


0.1076 0.0525 0.0028 
0.4243 0.0587 0.0040 


0.0960 0.0256 0.0007 
0.3958 0.0288 0.0008 


0.4299 0.0555 0.0040 
0.4253 0.0581 0.0040 


0.3942 0.0277 0.0008 
0.3952 0.0286 0.0008 


Sh 




di 
d 2 


0.1038 0.0542 0.0030 
0.3985 0.0572 0.0033 


0.0947 0.0259 0.0007 
0.3834 0.0273 0.0010 


0.4109 0.0550 0.0031 
0.3991 0.0570 0.0032 


0.3846 0.0265 0.0009 
0.3828 0.0273 0.0010 


0.6 


SSh 


n 7 

U. / 


di 
d 2 


0.1298 0.0483 0.0032 
0.4500 0.0665 0.0069 


0.1073 0.0248 0.0007 
0.4182 0.0361 0.0016 


0.4499 0.0566 0.0057 
0.4503 0.0607 0.0062 


0.4125 0.0310 0.0011 
0.4143 0.0327 0.0013 


0.9 


di 

d 2 


0.1167 0.0467 0.0025 
0.4274 0.0525 0.0035 


0.0990 0.0234 0.0006 
0.3981 0.0262 0.0007 


0.4290 0.0485 0.0032 
0.4280 0.0505 0.0033 


0.3943 0.0250 0.0007 
0.3953 0.0252 0.0007 


Sh 


- 


di 
d 2 


0.1034 0.0478 0.0023 
0.4029 0.0501 0.0025 


0.0939 0.0939 0.0006 
0.3858 0.0241 0.0008 


0.4077 0.04077 0.0024 
0.4037 0.0498 0.0025 


0.3840 0.0239 0.0008 
0.3836 0.0240 0.0008 


0.8 


SSh 


0.7 


di 
d 2 


0.1570 0.0512 0.0059 
0.4669 0.0647 0.0087 


0.1233 0.0278 0.0013 
0.4295 0.0374 0.0023 


0.4506 0.0535 0.0054 
0.4512 0.0557 0.0057 


0.4130 0.0290 0.0010 
0.4142 0.0298 0.0011 


0.9 


di 
d 2 


0.1307 0.0458 0.0030 
0.4372 0.0509 0.0040 


0.1055 0.0231 0.0006 
0.4037 0.0258 0.0007 


0.4288 0.0451 0.0029 
0.4290 0.0464 0.0030 


0.3946 0.0231 0.0006 
0.3953 0.0230 0.0006 


Sh 


- 


di 
d 2 


0.1060 0.0448 0.0020 
0.4071 0.0465 0.0022 


0.0946 0.0218 0.0005 
0.3882 0.0221 0.0006 


0.4066 0.0448 0.0020 
0.4056 0.0458 0.0021 


0.3838 0.0221 0.0008 
0.3840 0.0218 0.0007 


P 


Method 


P 


di 


d = (0.2,0.2) 


d = (0.2,0.3) 


a = 0.65 


a = 0.85 


a = 0.65 


a = 0.85 


mean st.d. mse 


mean st.d. mse 


mean st.d. mse 


mean st.d. mse 





SSh 


n 7 

U. 1 


di 
d 2 


0.2085 0.0570 0.0033 
0.1994 0.0572 0.0033 


0.1946 0.0277 0.0008 
0.1950 0.0294 0.0009 


0.2085 0.0569 0.0033 
0.3164 0.0643 0.0044 


0.1946 0.0277 0.0008 
0.3000 0.0321 0.0010 


0.9 


di 
d 2 


0.2087 0.0565 0.0033 
0.1989 0.0567 0.0032 


0.1924 0.0269 0.0008 
0.1926 0.0286 0.0009 


0.2087 0.0565 0.0033 
0.3078 0.0580 0.0034 


0.1924 0.0269 0.0008 
0.2919 0.0289 0.0009 


Sh 




di 
d 2 


0.2072 0.0581 0.0034 
0.1925 0.0601 0.0037 


0.1915 0.0271 0.0008 
0.1902 0.0291 0.0009 


0.2072 0.0581 0.0034 
0.2937 0.0603 0.0037 


0.1915 0.0271 0.0008 
0.2860 0.0291 0.0010 


0.3 


SSh 


0.7 


di 
d 2 


0.2072 0.0530 0.0029 
0.2014 0.0539 0.0029 


0.1945 0.0264 0.0007 
0.1952 0.0275 0.0008 


0.2097 0.0530 0.0029 
0.3171 0.0601 0.0039 


0.1951 0.0264 0.0007 
0.3000 0.0301 0.0009 


0.9 


di 
d 2 


0.2073 0.0527 0.0028 
0.2009 0.0538 0.0029 


0.1923 0.0257 0.0007 
0.1927 0.0268 0.0008 


0.2085 0.0527 0.0028 
0.3093 0.0547 0.0031 


0.1924 0.0257 0.0007 
0.2922 0.0270 0.0008 


Sh 




di 
d 2 


0.2050 0.0543 0.0030 
0.1955 0.0567 0.0032 


0.1912 0.0260 0.0008 
0.1907 0.0272 0.0008 


0.2051 0.0542 0.0030 
0.2967 0.0568 0.0032 


0.1910 0.0259 0.0008 
0.2867 0.0272 0.0009 


0.6 


SSh 


n i 
U. / 


di 
d 2 


0.2058 0.0464 0.0022 
0.2039 0.0473 0.0022 


0.1946 0.0239 0.0006 
0.1952 0.0242 0.0006 


0.2138 0.0470 0.0024 
0.3178 0.0525 0.0031 


0.1970 0.0240 0.0006 
0.3001 0.0267 0.0007 


0.9 


di 
d 2 


0.2057 0.0462 0.0022 
0.2035 0.0473 0.0023 


0.1923 0.0233 0.0006 
0.1928 0.0236 0.0006 


0.2095 0.0461 0.0022 
0.3111 0.0481 0.0024 


0.1929 0.0233 0.0006 
0.2926 0.0239 0.0006 


Sh 




di 
d 2 


0.2025 0.0478 0.0023 
0.1993 0.0496 0.0025 


0.1909 0.0235 0.0006 
0.1911 0.0239 0.0007 


0.2030 0.0477 0.0023 
0.3005 0.0496 0.0025 


0.1904 0.0235 0.0006 
0.2877 0.0239 0.0007 


0.8 


SSh 


0.7 


di 
d 2 


0.2054 0.0430 0.0019 
0.2049 0.0436 0.0019 


0.1948 0.0222 0.0005 
0.1952 0.0221 0.0005 


0.2192 0.0448 0.0024 
0.3208 0.0492 0.0028 


0.1998 0.0228 0.0005 
0.3014 0.0249 0.0006 


0.9 


di 
d 2 


0.2051 0.0428 0.0019 
0.2046 0.0435 0.0019 


0.1925 0.0216 0.0005 
0.1928 0.0215 0.0005 


0.2117 0.0428 0.0020 
0.3131 0.0445 0.0021 


0.1939 0.0216 0.0005 
0.2933 0.0219 0.0005 


Sh 




di 
d 2 


0.2016 0.0445 0.0020 
0.2008 0.0455 0.0021 


0.1909 0.0218 0.0006 
0.1912 0.0217 0.0005 


0.2026 0.0444 0.0020 
0.3024 0.0456 0.0021 


0.1903 0.0218 0.0006 
0.2883 0.0217 0.0006 
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(a) 



(b) 



(c) 



(d) 





(e) 



(f) 



(g) 



00 



Figure 6.1: Histogram and kernel density of the SSh estimated values of do = (0.1,0.4) for (a) and (b), p — 0, 
(c) and (d), p = 0.3, (e) and (f), p = 0.6 and (g) and (h), p = 0.8. 



The variance of the estimator responds stronger to changes in a than in (5 in the opposite 
direction, that is, the higher the a, the smaller the variance. The estimated value of d by the 
SSh seems to respond stronger to changes in a than in j3. 



0.9. Figures 6.1 ^a) and |6.1[b) correspond to p = 0, 



0.6 and Figures 6.1 'g) and 



Figure |6.1| presents the histogram and kernel density estimator of the SSh estimated values 
for d = (0.1,0.4) when a = 0.85, p 

Figures [6jjc) and |6.1[ d) to p = 0.3, Figures [6~T| e) and |6.1[ f) to p 
|6.1fh) to p = 0.8. Notice that the resemblance of the histograms to the normal distribution is 

(2) 

greater for Oq = 0.4 than to 0.1. At this moment, we were not able to prove the asymptotic 
normality of the SSh estimator with the smoothed periodogram by direct verification of (5.1). 



However we conjecture that this is the case and Figure 6.1 supports this opinion. 



7 Conclusions 



In this work we propose and analyze a class of Gaussian semiparametric estimators of multivari- 
ate long-range dependent processes. The work is motivated by the semiparametric methodology 
presented in Shimotsu (2007). More specifically, we propose a class of estimators based on 
the method studied in Shimotsu (2007) by substituting the periodogram applied there for an 
arbitrary spectral density estimator. We analyze two frameworks. First we assume that the 
spectral density estimator is consistent for the spectral density estimator and we show that the 
proposed semiparametric estimator is also consistent under mild conditions. Second, we relax 
the consistency condition and derive necessary conditions for the consistency and asymptotic 
normality of the proposed estimator. We show that the variance-covariance matrix of the lim- 
iting distribution is the same as the one derived in Shimotsu (2007), under the same conditions 
imposed in the process. 

In order to assess the finite sample performance and illustrate the usefulness of the estimator, 
we perform a Monte Carlo simulation based on VARFIMA(0, d, 0) processes. We applied the 
so-called smoothed periodogram (which is shown to satisfy the consistency conditions imposed 
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in the theoretical results) with the Bartlett's weight function as the spectral density estimator. 
For comparison we also compute the estimator proposed in Shimotsu (2007). Both estimators 
perform great but the one proposed here present generally better results. 

The assumptions required in the asymptotic theory are mild ones and are commonly applied 
in the literature. The semiparametric methodology present several advantages compared to the 
parametric framework such as weaker distributional assumptions, robustness with respect to 
misspecification of the short run dynamics of the process and efficiency. The theory includes the 
fractionally integrated processes as well as the class of VARFIMA processes. 
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Appendix A: Proofs 

In this section we present the proofs of the results in Sections 4 and 5. We establish lemmas and theorems 
in the same sequence as they appear in the text. 



Proof of Lemma I4.lt 

By hypothesis, / n (A) = /(A) + op(n _/3 ). Recalling the definition of Aj given in (2.1 1, we have 

1 m 

G ( d °) = - E Re[A i (d )- 1 /n(A i )A j (do)- 1 '] 

1 m 

= - E MHdo)- 1 (/(A,) + O p(n^))A,(d )-i '] 

q m 

= G °+E([-E A f° fc H n_/3 )) = G o+ OP (i), 



since 



Lra 

k=l j = l 



Ea 



op(n-") 



1 /27rra\ 
2d° k + 1 \~) 



2d" 



2dl + 1 



m 



E 

3=1 



2.1" 



2,1" 



where the second equality follows from lemma 2 in Robinson (1995b), by taking 7 = 2d" + 1 > 0. The 
last equality is justified as follows: recalling that /? € (0, 1), if > 0, (m/n) 2d ° — > 0, and the result is 
immediate; if d° < 0, m 2d ^ — > and the result follows from the assumption that d £ ftp, which implies 
2d° k + f3> 0. This completes the proof. ■ 



Proof of Theorem I4.lt 

Let 6 = (0 1; • • • ,e q )' :=d-d and L(d) := 5(d) - S(d ). Let < 5 < 1/2 be fixed and let 

N s := {d: Hd-dolU >*}• 
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Let < e < 1/4 and define 6i := JO : G [-1/2 + e, 1/2] 9 } and 6 2 = n^\0i (possibly an 
empty set), where Vtp is given by (4.2). Following Robinson (1995b) and Shimotsu (2007), we 
have 



Pflld-dolU >S) <P(_raf {L(d)}<0) 

<p(jnf (L(d)} < o) +Pf inf {L(d)} < o) := Pi + P 2 (A.l) 

where, for a given set O, O denotes the closure of O. We shall first show that Pi — > 0, as n 
tends to infinity. Rewrite L(d) as 

L(d) = log (det{G(d)}) - log (det{G(d )}) -2^0 fe ~ £ log(A J ) 



m 

fc=l 3=1 



27rm^~ 2Efc<?fc 



log ( det{G(d)}) - log ( det{G(d )}) + log 

- 2 E ^ - E m?) - lo s( m ) - E lo g( 20fe + !) 



, m 

k=l v 3=1 ' k=l 



where 



log (*(d)) - log (<B(d)) - log (*(do)) + log (B(do)) + H{d) 
Q 1 (d)-Q 2 {d)+H{d), (A.2) 



Qi(d) := log log (0(d)), Q 2 (d) := log (*(<*,))+ log (0(do)), 

X(d) := ( — J det{G(d)}, <8(d) := dct{G } ]J w ~^, 

^ ' k— 1 
9 



and ^(d) := 2]T fe log(m) - - £ log(i) J - £ log(20 fe + 1). 

fe=l ^ m 3=1 ' k=l 

By lemma 2 in Robinson (1995b), log(m) — m _1 X^jli l°s(i) = 1 + 0(m _1 log(m)), so that 

« r , 

m(d) = E ^ [ 20 fc ~ lo s( 2 ^- + !) 



() ,'log(m) 



fc=i 

Since x — log(x + 1) has a unique global minimum in (—1, oo) at x = and x — log (a; + 1) > x 2 /4, 
for I a; I < 1, it follows that 



inf {^(d)} > 1 (2m&x{6 k }) 2 > S 2 > 

■ne, 4 V k ) 



0. 



As for Qi(d) and Q%(d) in (A.2), it suffices to show the existence of a function h(d) > satisfying 
(i) sup{|^(d) - h(d)\} = op(l); (ii) /i(d) > (8(d); (iii) /i(d ) = S(d ), 



Si 



as n goes to infinity, because (ii) implies inf|/i(d)| > inf|2?(d)| > 0, so that, uniformly in 0i, 

0i 0i 

Q x {d) > log (*(<*)) - log (/i(d)) - log (ft(d) + op(l)) - log (h(d)) = o P (l), 

and (iii) implies Q^id) = log (h(do) + op(l)) — log (/i(do)) = op(l), from which Pi — >0 follows. 
To show (i), recall that 



L J {d)- 1 = diag {Xf gi(Ai-T)*/a} = diag {A^-^e^-^C*-^/ 2 x A? e 1 ^-*^/ 2 } 
fce{i,-,?} fee{i,-,9} 

= A,(d - do)- 1 A, (do)- 1 = AjpJ-^do)- 1 , 
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so that we can write 



rn 

G(d) = -J2 Re [ A jW~ 1 G(d )7^{e)^ / ] 



(A.3) 



3=1 



and 



-^Re[A J (6»)- 1 G(d )A J (6>)- 1 '] I 

m .7 = 1 J 



det l — ^Re[A^(6>)G(rf o )Af J (0) '] L 



(A.4) 



where Mj(0) : = diag fcg j 1; ... q y |e^ Aj 7r ^ fc / 2 (j/m) 6,fc |. To determine the function h in (i), we first 
show that 

— ^2He[Mj(d)G(do)Mj{6) '] = — E Re [Mj (0)GoMj(d) '] + o P (l), 



rn 



3 = 1 



3 = 1 



uniformly in 0j. By Lemma 4.1, it follows that 

-i TO 

- E Re [M, (0) (G(do) - G )M~W) ' 



3 = 1 



-^Re[M,(0) Op (l)Af J (0) ']. 

:Jl 3 = 1 



(A.5) 



(A.6) 



The (r, s)-th element in (A.6) is given by 

B T +B S 



1 



E Rc 



3 = 1 



,i(A J - 7 r)(0 r -e s )/2 f J 



op(l) 



E 



m * — ' \ to 

3=1 



?r + 9 S 



o P (l) = Op(l), 



where the last equ ality follows from lemma 2 in Robinson (1995b) by taking 7 = 9 r + 8 S + 1 £ 
[2e, 1]. Therefore, (A.5) follows uniformly in Q\. The next step is to derive an approximation 
to the RHS of QA.5D . First, since &(*-^Wr-0,)/2 _ e i7r(e r -e s )/2 + the ^ s ). th e i ement Q f 

the RHS of (A.5), omitting the Re[-] operator, can be written as 



j=i 



1 i(A 3 -7r)(e r -e s )/2 f 2 



Grs 
— 



-i7r(e r -e,)/2 



E 



TO 1 ' \ TO 

3 = 1 



6r+8 s 



GT 
I) 



' V ( J ) (7: 

TO ^— ' \ TO 



Now, from lemma 2 in Robinson (1995b), it follows that 



E 



TO ' — ' \ TO 



1 /27TTO 

1 + e r + e s l^T 



w r 1 ^ s 

m 



1 



1 



/ r I \J Q 



2 nm 



E 

j=i 



(i + 0(to~ 2£ )). 



(A.7) 



Substituting (A. 8) on (A.7), it follows that 

8 r +9 s 



1 



E 



3 i(A 3 -7r)(0 r -9 s )/2 / 1 



J=l 

Define the matrices 



op(1) = 



-itr{9 r -8 s )/2 



G o+0{^)+O(m- 2 -). 



(A. 



(A.9) 



~i7r(0 r -e s )/2\9 



and A4(0) 



1 + # r + #s / r s —i 



and notice that, from (A.9), we have 



— M j (e)G M j {e) ' = A4(6>) G + O (™) + 0{m~ 2e ). 



(A.10) 



j'=i 
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Upon defining, 



h{d) :=det{Re[£(0)] ©A4(0)©G O }, 



the continuity of the determinant, the finiteness of £{6),Ai{6) and Go, for 6 G @i, and (A. 10), 
imply (i). Inequality (ii) follows upon observing that V{6) := Re[£(0)] Ai(6) is positive 
semidefinite (cf. Shimotsu, 2007, p. 293) and, from Oppenheim's inequality, 

q q 
h(d) > J] V kk {8) det(Go) = \{ M kk (0) det(G ) = B(d). 

k=l k=l 

By definition, £ rs (0) = 1, for all r, s = 1, • • • , q, so that 

h(d ) = det {M(0) G } = ®(d ), 

and (iii) follows. 



Now we move to bound P2 in (A.l). Expression (A. 3) can be used to rewrite L(d) as 

q rn 

L(d) = log(det{G(d)})-log(det{G(d )})-2^^-^log(A,) 
= log (det {V{d)}) -log (det {V(d )} 



rn 

k=i 3=1 



(A.11) 



where 



with 



3(d) ^J2 Re [' p M~ 1 G(d )v^e)= 1 '], 



Vj(0):= diag /e 1 ^-^ 9 "/ 2 (^ j and ? := exp (— ]|Sog(j) 



and, as m tends to infinity, p ~ m/e. Observe that T>(d) is positive semidefinite since each 
summand of T> is. For k £ (0, 1), define 

P K (d):=- £ Re[^(0)- 1 G(d o )^(0)-i '] and Q re (d) := — £ Re[^-(0)G o ^(0) '] , 



j=[Tnre] 



where [x] denotes the integer part of x. By Lemma 4.1 



m 1 m 



j' = ["l«] 



The (r, s)-th element of T> K (d) — Q K (d) is then 



(v K (d) - Q K (d)) =Rc - V e 1 



,i(A 3 -7r)(e r -e 5 )/2 / J 



op(1) 



1 E 

m * — ' 



j=[mn] 



1 r +B s 



op(1) 



0(1)op(1) = op(1), 



uniformly in G G2, where the penultimate equality follows from lemma 5.4 in Shimotsu and 
Philips (2005). Therefore, as n — > 00, for any k 6 (0, 1), 

sup {| det{V K (d)} - det{Q re (d)}|} = o P (l). 
©2 
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Now, for all d G 02, one can obtain A S (0, 0.1) so that, for k sufficiently small and m sufficiently 
large (cf. Shimotsu, 2007, p.295), 



inf{det{P K (d)}} =inf{det{Q K (d)}} +o P (l) >det{G }(l + A) + o P (l). 

©2 O2 



Now, since det {V(d )} = det {G(d )} det{G }, as n — > 00, by Lemma 



4.1 



it follows that 



P(^inf {det{V K (d)}} -det{r>(d )} < 0) — > 0, asn^oo 



Also, since det |2?(<i)} > det we conclude that 



Pfinf {det {25(d)}} -det {©(do)} < OJ — ► 0, as n -> 00. (A.12) 



Since the logarithm is a monotone increasing function of its argument and in view of (A.ll), 
(A.12) implies P2 > and the proof is complete. ■ 



Proof of Lemma 14. 2\ 

For fixed r, s 6 {1, • • • , q}, let stf uv := ^j= u ^ an d &uv '■= Yfj=u where 

*S :=e^-)( d ?- d »)/ 2 Af +d °[/r(A,) - (A(A i )) p ./.(A,)(A^)') J, 



(A.13) 



and 



:= eKAi^W-d!)^ +<(A(A,)) r J e (A i )(A(A i ) ').. - ^ 



(A.14) 

Hence, for each j, ^ + & d = e K^-*)(<%-d°)/2 X 4+d° frs( X .} _ qts_ For fixed u <j< VjWe have 

E(K-|) = (2nj) d ° +d '^(n- d °- d ° /™(Aj) - (A(Xj)) r I e (Xj)(A{Xj)')_ 

1 



(27rj) c 



= 



o(r 7 ). 



Therefore, max{E(| ^J =u -*Sj|)} = 7 ) and the result on £if uv follows. As for S8j, from the 

proof of lemma 1(a) in Shimotsu (2007) (notice that BBj does not depend on f n ) it follows that 
Y^ V j= u ^i = °~p( v ) uniformly in u and v and hence the desired result on £$ uv follows. ■ 



Proof of Theorem 14. 2\ 



From a careful inspection of the proof of Theorem 4.1, we observe that it suffices to show (with 
the same notation as in that proof) 



(A.15) 



3=1 



3=1 



uniformly in ©1 and that T> K (d) — Q K ( d) = o p(l), uniformly in 02- To show (A.15), notice that 
the (r, s)-th component of the LHS in (A.15) is given by 



3=1 



3 i(A,-7r)(e r -e s )/2 / 1 



/-(A J .)(Af ) (d )A(. s) (rf ) 



Summation by parts (see Zygmund, 2002, p. 3) yields 



sup • 

0i 



3=1 



,i(A 3 -7r)(e r .-0 3 )/2 f J_ 

m 



/-(A J -)(Af(d )Af (do) J-G% 



< 
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1 



m— 1 



< — } sup 

k=l Ul 
k 

X 



r + 9 a 



J(\ k + 1 -ir){0 r -8 s )/2 



fc + 1 



£ [/-(A^Af (do)A< s) (d ) ') - G^] + 1 £ [/; s (A^)(Af ) (d )Af (do) ') - G 



3=1 
m— 1 



3=1 



< 



^E ~ ^> E[/« S ^)( A f (^)Af(do) )"-05 

k=l V / j=l 

^E[/« S (^)( A 5 r) (rfo)A5 s) (do) ) -G5 



(A.16) 



where < C < oo is a constant. Now, from Lemma 4.2 

/,•- 



" 1 



E 

fc=i 



E[/"(^)( A r( d o) A S s) ( d o) ) -eg 

3=1 

m— 1 / , v 2e ^ 

^(Kl + I^-I) 



1 m— 1 rn—1 



fe=i 



k=l 



, . m(m — l) 2e 1 . , 
op(1) + 5^ op(1) - op(1), 



uniformly in (r, s), where the penultimate equality follows from Lemma 4.2 which implies 



E 



1 rn— 1 \ 1 m— 1 

^E^KI =i^ fc2(e " )E (K fe l) 



fc=i 



fe=i 



< (m ^ 0(( TO - 1)1-7) = o(l) ; 



uniformly in (r, s). The other term in (A.16) is also op(l), uniformly in (r, s), by the same 
argument and, hence, (A. 15) follows. On the other hand, the (r, s)-th element of T> K (d) — Q K (d) 
is given by 



1 

m 



E ^ 

j = [mK] 

m 
V 

op(1) 



3 i(A 3 -7r)(0 r -0 a )/2 [ I 



o r +e t 



/; s (A J )(Af ) (do)Af ) (do) / )"-GS 



Re 



E • 



a i(A 3 -7r)(e r -9,)/2 I I 



III 



o r +e. 



^*(A J -)(A} r) (do)AW(d ) 



\-i 



n 



uniformly in £ 62, where the last equality is derived similarly to (A.16) from summation by 
parts and lemma 5.4 in Shimotsu and Phillips (2005). This completes the proof. ■ 



Proof of Corollary |4.1 



From the proof of Lemma 4.2 , in order to show ( |4.3[ ) it suffices to show that, for sij as in ( A. 13 ) 
and f n as in the enunciate, E(|^-|) = 0(j~ 7 ), for some 7 > 0. From the proof of lemma 1(a) in 
Shimotsu (2007) (see also theorem 2 in Robinson, 1995a), we have 



E(l n (X j ))=f(X j )(l + 



log(j + 1) 



3 



E(/ e (A,)) 



log(j + 1) 
3 



(A.17a) 
(A.17b) 
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E«(A;K(A;) 
Assumption A3 implies 



2tt 



■\d° 



for j = !,-■■ ,m. 



(A.17c) 



E(|^|)=E 



< 



\<t(n) 

E ^ s (fc)E(| e 1 ^— )(«-««)/ 2 Af +d ° [C(A J+fc ) - (A(A i )) P ./.(A i )GTO ').J |). (A.18) 



|fe|<£(n) 



Rewrite the expression inside the squared brackets on the RHS of (A.18) as 



K(A+fc) - (A(Ai)) r w e (A J -))J<(A i+fc ) + (A(Aj)) r w,(Aj)[ttfH(A i+fc ) - (A(A J )). S ^(A J ) 
Now, observe that 

<(A, +fe )^f(A7) ' = ^-(E ) ( E £ t (s) e"^ ) = ^ E E 



t=i 

n n / oo 



t=l 



a=l 6=1 



27rn 



a=l 6=1 x t=0 



t=0 



so that, by using (A. 17b), we conclude that 



E 



(w r n (\ j+k )wi(\j) ') = ^f r - 0(1) + O 



log(j + 1) 



(A.19) 



From jAJTal ), jATfb) , ( [ATfc) , (A(A i )) r .(A(A i ) ') r ./27r = f rr (Xj), /"'(A;) 2 ^ ~ Gg' r , assump- 
tion A5 and because the function log(|x| + l)/|x| is increasing in (— oo,0) and decreasing in 
(0, oo), it follows that 

E(| «(A i+fe ) - (A(A,)) r . U ; e (A i )) f) = E(l?(\ j+h ) - <(A j+fc R(A~) '(A(A^ ') r .+ 

+ (A(A^)) r I e (A j )(A(A7) '). s - (A(A7) ) r w e (X j )w r n (X j+k ) 
log (min{| j + k\,j} + 1) 



min{|j + fc|,j}A^f fc 



0(fl>{j,k,4)), 



where ^(j,k,d° r ) := log(min{li+fc| ' i 2 } + 1) , and similarly for E(|<(Ay) '- (A(A 3 -)) ^(Ay) '| 2 )- Also ; 

J- 



E(/* s (Aj+fc)) = 0(A J+fc s ). By the Cauchy-Schwartz inequality, 

E(|«(AA- (A(A J -)) r .^(AA)| 2 ) l E(|<(A i+fe )| 2 )' + 
+ E(| (A(AA) r .™ e (A^| ' - (A(X 3 ))_w e (X 3 ) ' 



E(K|)< E WnWlX* 
\k\<£(n) 



E w; i (fc)|A J | d?+d -0(^C/,A 1 d?))0(|A J+fc |- t «) 
|*|<«(n) 



= E cwo^ 

|k|<*(n) 



log (min{|j + fc|, j} + 1) 
min{|j + fc|, j} 1 / 2 



1/2 > 



d"+d{ 



0(j~ 70 ), 



for < 70 < 1/2, where the last equality follows from assumption A5. This concludes the proof.l 
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Proof of Lemma I5.lt 



(a) For r, s E {1, • • • , q} fixed, ignoring the maximum in expression ( 5.2 ) for a while, its argument 
can be written as 



^e^-)^^^)/ 2 Af +d °[/-(A,)-(A(A J 0) r Je(A J Ora / ). s 



.i=i 



< 



< o 



27Tj 

n 



V d "+ d s \ 



/; s (A,)-(^(A J -)) r ./ e (A J )(^(A J -) 

u i/a 



Op 



log(u) / ' 



uniformly in 1 < u < m and the result follows. 



(b) Rewrite the argument of the summation in (5.3) as srfj + + where 



^c^-^-^Af^ 

# ;= e i(A,- 7 r)«-d°)/2 A ^+^ 



/-(A,)-(^(A J )) r ./ £ (A J )(v4(A J ) ). s 
(A(A J -)) r / e (A J )(I(A7) / ). s -/r S (A J ) 



Part (a) yields max r)S j^jLi J = °P ^w 1 / 2 ( log(w)) ^ , while, from the proof of lemma l(b2) 

in Shimotsu (2007), we obtain max r)S |X/j=i = Op (v 1 / 2 log(v)) . Assumption CI implies 

max^s |Z^J=i l^j'l} = 0(-u a+1 n _a ) . The result now follows by noticing that v 1 / 2 (log(v)) 1 = 
O^log^)). ■ 



Proof of Theorem I5.lt 



The idea of the proof is similar to that of Lobato (1999) with similar adaptations as in Shimotsu 
(2007). By hypothesis, 



i(d-do), 



dS(d) 




dS(d) 


+ 


fd 2 S{d) 




dd 


d 


dd 


do 


V dddd' 





with probability tending to 1, as n tends to infinity, for some d such that ||d — do||oo < <io||oo- 
We observe that d has the stated limiting distribution if 



95(d) 



and 



dd 



d 2 S(d) 
dddd' 



-> N(0,fl) 



-> n. 



(A.20) 



(A.21) 



We shall prove ( |A.20[ ) first. In order to do that, we use a Cramer- Wold device. Let rj be an 

dG(d)~ 



arbitrary vector in H 9 . Observe that, for r E {!,■■■ ,q}, 



m - 



dS(d) 

dd r 



5>g(A,- 



tr 



dd r 



Let I( r ) denote a q x q matrix whose (r, r)-th element is 1 and all other elements are zero. Define 
a function <p : (0, oo) — > C by 

<Kz):=log(z)+ifc^Y (A.22) 
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Since A^d)" 1 = diag {A^e i(Aj_7r)a!fe/2 } and Re [(a + ib)(c + id)] = ac - bd, we can write 
ke{i,-, q } 



dG(d) 



dd r 



m 

= ^Re[^(A J )A,( ( io)- 1 I (j . ) /„(A J )A 3 ( ( io)- 1 ' 

' ' =1 

1 m 

^ ^ Re[R^)A,(d )- 1 / n (A J )I (r) A J (d )- 1 ' 



L ^log(A J )Re[A J (d )- 1 (l (r) / Il (A J ) + /„(A J )I w )A i (d )-i '' 



3=1 
1 

m 



E 

3=1 



Aj -7T 



111! 



A^do)"^ - I(r)/n(A 3 -) + /„(A J )I (r) )A,(d )- 1 



:=^?(r) + J*|(r). 



(A.23) 



Therefore, from (A.23), we obtain 

9 



dS(d) 



Od 



E 



VkVm 



dS{d) 



d o k=l 

2 

m 



dd k 



E^ 

fe=i 

--^1+^2- 



Y lo g(A,) + *r [GCdo)" 1 ^^)] + Y, Vk tr [G(d )- 1 ^ 2 (fc)] , 

3=1 



k=l 



We analyze £%\ first. By letting 

0j := logCA,-) - - Vlog(A fc ) = log(j) - - Vlog(fc) = 0(log(m)), 



k=l 



k=l 



we can write 
2 



'"■ r / 9 \ 

Y^g(Xj)+ti [G(d y 1 Jf 1 (k)] = tr G(do)- 1 (j%(k) - — ^ log(A,)G(d )I (fc) 

3=1 L ^ 3=1 ' 



tr 



G(d )- 1 ^^ aj Re[A J (d )- 1 /n(A J )A,(do)- 1 ' 



(A.24) 



o m 

(Gq 1 ) fc . + op(l)] ^= 2 Oj (Re[A,(d )- 1 / n (A J )A,(d )- 1 ' 



By Lemma |5.1| (b), (A.24) can be written as 

2 

n 

3=1 

Now, by Lemma |5.1[ a), 

m 

53a i A j (d Q )- 1 (/„(A J ) - A(A 3 -)/ e (A j )^(A~) ') A^do)- 1 ' 



(A.25) 



3 = 1 



< 



< 0(]og(m)) max J I ^^-J^-^Af +d ° (/"(A.) - (A(A 3 )) r ./ £ (A,)(A(A 3 ) ').. 



0(log(m))op 



op(vrn) 



log(m) 

uniformly in r, s 6 {1, • • • , q}. Therefore, 



— 53 a 3 A 3( d 0)" 1 /n(Aj)A i (d )- 1 ' 



3 = 1 
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= -^2oj A i(*)" 1 ^)- r «(^Wi) A^do)" 1 ' + -^op(^) 

= -t=£ Oj k-(d )- 1 A(A J )I e (A J )I(A^ 'a^cZo)" 1 ' - G ] +o P (l), (A.26) 
v m J=1 

where the last equality follows from Y^JLi a j = 0- For the moment, ignore the op(l) term in 
(A. 25), which will be dealt with later. By doing so and applying (A.26), it follows that 

q m 

]T V k £ a, [(Gq 1 ),. (Re[A J (d )- 1 A(A,)/ £ (A,)A(A^ '] ) - l] + o P (l). 



fc=l 3=1 



Writing 



/e(Ai) = 2^ E £ ' £ * + 9^ E £ «< ei( " _,,)Ai : = J iW + Ur,n), 



t=l 



2irn 



(A.27) 



U,17 — 1 



we can rewrite as 

9 



:£ ?7fe £ aj [(G - 1 ) fc .(Re[A,(do)-M(A J )JiW^(A J ) ^do)" 1 ' 

fe=l ] = l 

.,1m 

+ 7^E%E^[( G o 1 ) fe .(Re[A J (do)- 1 A(A i )J 2 (i;n)A(A J ) A^do)- 1 ' 



fe=i j=i 



) 1 +o P (l). (A.28) 



The first part of the RHS in (A.28) is op(l), which follows (cf. Shimotsu, 2007, p. 297) from 



applying the proof of lemma 2 in appendix D of Lobato (1999), so that we can write 

n t-1 

f. ..^.4.^(11 with A •= £„,!{<• <>,0 i,A • !., 



u =2 u = l 



7m vm 



j'=l 



where 



fi, :=X>(A(A,) A.K)- 1 YjGo 1 ), A,(d )-M(A J ). 



fe=i 



By writing 



■^1 can be further simplified, by using the argument in the proof of theorem 2, p. 297, in Shimotsu 
(2007), as 



n t-1 



u=2 v=l 

As for proceeding analogously to we obtain 



tr 



G(do)- 1 JT 2 (A;)] = [(G^ 1 ),. + 0P (1)]^£(A 3 -7r)(lm[A,(do)" 1 /n(A J )A,(d )- 1 ' 

v m j=1 

On one hand, 

- m 

£(A, - 7r)A J (d )- 1 (/„(A,) - AiXMXj)^) ')A,(d )-i' 

£ (A . _ ^^(^--Sj/^-hC ( /r(Aj) _ {A(Xj))r MX^AiXj) ') 

m 



■k 



(A.29) 



(A.30) 



5=1 



5=1 



< 7T 



5=1 
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0(1) o P 



/ y/rn 



V lo g( m / 



Op 



log(m) 



(A.31) 



On the other hand, by using (A.27), 



^A J A,(d )- 1 A(A,)/ £ (A J )A(A J ) 'A,(d )-i ' < 

n m 

^ ^E A ^ d o)-M(A J )/ £ (A,)A(A J ) A^do)- 1 ' 

3=1 

E A, (do)- 1 A(A J ) - E ete'jA^A^do)- 1 '+ 

m / 1 n \ 

™E A ^ d o) -1 ^) - E ^-^ to'A^do)- 1 ' 



m 



i=i 



U,l'=l 



^1+^2 



n 



-1/21 



From theorem 1 in Heyde and Senata (1972) and assumption C2, - X^"=i £ t £ t = Ig + Op( 
so that assumption C5 implies, for n sufficiently large, = op(y/m). As for =5^2, by the proof 
of lemma l(b2) of Shimotsu (2007), 



,y 2 = — Op (m 1/2 log m) = op (y/m) , 
n 

where the last equality follows by assumption C4. Therefore, 

1 m 

^Ew^r 1 ^)^-)^ 'Aj(do)- 1 ' = op(i). 



(A.32) 



Combining (A.31) and (A.32), we obtain 



E(A,- -7r)Im[A j (d Q )- 1 /n(A J )A,(d )- 1 ' 



m 

^= E Im[A J -(d )- 1 A(A i )/ e (A i )l(A7) 'a^ 



■Qp(1). 



3=1 



Again, by ignoring the op(l) term in (A.30) and proceeding as in p. 298 of Shimotsu (2007) Sti 
can be approximated as 



n u— 1 



2 = E < E 6-« e » + op ( 1 )' with & := E Re t% - ty] sin (%)- ( A - 33 ) 

u=2 v=l * 7=1 



Let 



2i := and Z t := e' t E (&-« + |t-»)e U) for i = 2, 3, • 



Combining (A.29) and (A.33), we conclude that 



E 

k=l 



95(d) 



= E^+op(i). 

do t=l 



Obser ving th at {Zt} ( ^ =1 is a zero mean martingale difference, by a standard martingale difference 
CLT, (|A720|) follows if (cf. Billingsley, 1995, theorem 35.12) 



9 1 



EE(Z t 2 |JVi) - EE 7 ^ 7 ^ °' as n ^ 



(A.34) 



t=i 



fe=i z=i 
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and 



^>(Z?I(\Z t \ > S) -A 0, for all S > 0, as n -> oo. 



(A.35) 



Both results follo w from the arguments in the proof of theorem 2, p. 298, in Shimotsu (2007), 
since (A.34) and (A.35) are (29) and (30), respec tively, in the aforementioned paper. Also 
observe that the op(l) terms in (A. 25) and (A. 30) do not affect the results, since, from our 
arguments, 

m m 

^a 3 Re[A J (do)- 1 /n(A J )A J (d )- 1 '] and -= ]T Im[A J (d )~ 1 /n(A J )A J (d )- 1 ' 



are both Op(l). This completes the proof of (A. 20). 



We now move to show ( A. 21 ). For fixed 5 > 0, let := d — do and define 

M := {d:log(n) 4 ||d-do||oo < <*} = {o : log(n)*||0||oo < 

First we show that P(d G A/f) — >• 1, as n — > oo. Assuming the same notation as in the proof 
of Theorem 4.2, recall the decomposition of L(d) = S(d) — S(do) given in expression (A. 2). By 
applying the same argument as in the proof of Theorem |4.2[ we first obtain 

inf {*(«*)} ><5 2 log(ri) 8 , 



and upon applying Lemma 5.1 we obtain 



sup(|A(d) - h(d)\ \ = O f 



m a log(m) m 



It follows, uniformly in Gi (cf. Shimotsu, 2007, p. 300), that 

log (*(d)) - log (3(d)) > log (fc(d) + o P (log(7i)- 8 )) - log (h(d)) = o P (log(n)- 8 ) 
log (X(do)) - log (!8(do)) = log (fc(do) + o P (log(n)- 8 )) - log (fc(do)) = o P (log(n)- 8 ) 

Hence, P(inf Ql \_ M L(d) < 0) ->■ and P(d G A4) ->■ 1, as n — >• oo, follows. 
Now, observe that 

d 2 s{d) 



= tr 



-Gid)-^G { d)-^ + G { d)-^ G W 



dd. 



dd r dde 



dd r dd s 

The derivatives of G(d) are given by 
fir* i a\ i rn i m 

— J-i = -^Ref^(A J -)I W A J -(d)-Vn(A i )A J -(d)-i '] + - ^Ref^(A7)A J -(d)- 1 /n(A i )A i (d)- 1 'l (r ) 

(A.36) 



and 



f^T =^ERe[^(A J ) 2 I W I( s )A J (d)- 1 /n(A J )A,(d)- 1 ' 

T S 3=1 

1 m 

+ - E Re[|^)| 2 I (r) A i (d)- 1 /n(A i )A j (d)-i 'i 

1 m 

+ - E Re[|^)| 2 I (s) A j (d)- 1 /n(A i )A7(dp 'i 



(r) 



1 m 

- £ Re[^)%(d)- 1 / n (A J -)Va)= T 'i 



(r)I( S ) 



J = l 
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where <p is given by (A.22). For k = 0, 1, 2, let 



= i53log(A J -)*Re[A i (d)- 1 /n(A i )A J -(d)- 1 ' 



I*(d) - ^X]log(A j ) fe Im[A j (d)- 1 /n(A j )A J -(d)- 1 ' 



so that, we can write 



= I (r) fti(d) + fti(d)I (r) + £ (l w 2b(d) - 2b(d)I(r)) + op f ^^7^) , (A.37) 



and 



d 2 G(d) _ ^ 
dd r dd s 4 



log(n) 



I( r )I( a )7^ («0 + I(r)^o(d)I( s ) + I( a )fto(d)I( r ) + 7^ (rf)I(s)I(r) + 

I (r )I( s )Ii(d) +Zi(d)I( r )I (s ) + I (r .)I( s )72. 2 (d) + I( r )^ 2 (d)I( s )- 
+ I (a) ft 2 (d)I (r) + ft 2 (d)I (s) I (r) +op(l). 



(A.38) 



The ord er of t he remainder term in (A.37) is obtained as follows. Rewrite the first term on the 
RHS of (TA361) as 



-^Re^A^Icr) (RefA^^-VnCA^ATpp ' 



i Im 



z * 

j'=i 



log(Aj)I (r .)Re A,(d)- 1 /„(A,)A,(d)-i + Hog^I^m A J (d)- i /„(A,)A J (d)-i 



(r) 



Re 



(r) 



111! 



E 



log^I^Re A^dJ-VnCAiJA^d)- 1 



Aj -7T 



I( r )Im 



A i (d)- 1 /„(A J -)A ; ,(d)- 1 
A^^-VnCA.OATPP' 



By summation by parts, Lemma 5.1 and assumption C4 



, rn m—1 j 

- ]T Aj [A^^-VnCA^ATppr '] < £ | Aj - - A J+1 | £ A^^-^nCA^A^Idp ' 

m 3 = 1 m 3=1 k=l 

+ EA J (d)- 1 /„(A,)A-(dp' 

k=l 00 

Op (m - 1) 1/2 log(m - 1) + i -I + O - 



< 



rn n 



O 



to 1 / 2 log(m) m" 
n n a 



O 









) ^ 771 y 










±) - 

ran / 


° P (log(n 



where the last equality follows from 

m 1 / 2 log(?7!,) 



m 1 / 2 log(m) log(ra) 

log(n) = — T71 VTA > 0, 



n l/2 „l/4 n l/4 



as n tends to infini ty, by assumption C4. The other term in (A.36) is dealt analogously. The 
remainder term in (A.38) involves 



3 = 1 
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for p(Xj) proportional to Xj,Xj and Xj log(Aj). The order of the terms proportional to Xj has 
already been obtained, while the one proportional to A^ is dealt analogously since A| = O(Xj). 
The term proportional to Xj log(Aj) is op(l) since, by summation by parts, Lemma 5.1 and 
assumption C4, 



3=1 



Xj log(Xj) [A J (d)- 1 /„(A 3 )A J (d)-i ' 
1 ™~3 

^ - E | A J lo g( A ^) - A J+i log(^+i)| ^AfeC^-VnCA.OMdp ' 

fe=l 

rn 

^Hdr'fniXjjA'w 1 ' 



3=1 



X m log(A m ) 
to 

= - Op f (m - I) 1 / 2 log( m - 1) + (™~l) a+1 
In order to finish the proof, it suffices to show that, 



o(l) + o P (l) = Op(l). 



and 



ft*(d) =G - Vlog(A,)' £ + 0p (log(n; 

TO V 
Ife(d) =0 P (l0g(7l)' £ - 2 ), 



fc-2\ 



(A.39) 



(A.40) 



uniformly in d 6 .M. Indeed, if (A.39) and (A.40) hold, upon defining for a matrix M, 



Ti (M,r) :=l {r) M + M\ r) , T 2 {M,r,s) := I (r) I (s) M + I (r) MI (s) + I (s) MI (r) + MI (r) I (s) 



and 



T 3 (M,r,s) := -I (r) I (a) M + I (r) JWI w +I (s) MI (r) -MI (r) I (s) , 
it follows that (cf. Shimotsu, 2007, p. 301) 

G(d)- 1 = Go 1 +o P (log(n)- 2 ), = i f^og^ (G ,r) + o P (log(n)- 1 ), 



Since tr 



and ^4 = to E log(A J ) 2 T 2 (G , r, S ) + ^ T 3 (G , r, a) + o P (l). 
G 1 T 1 (G ,r)G' 1 T 1 (G'o, S )] = tr [G ^(Go, r, s)) and 

-. m / -. m \ 2 



we obtain 



d 2 s(d) 

ddrdd* 



= tr 



Gq X T 2 (G , r, s) + \ G^T 3 (G , r, s) 



op(l), 



from where (A. 21) follows. We proceed to show (A.39) and (A.40). For k = 0, 1, 2, let 



r k {0) := -^logCA^^W-^oA^T'. 



Then, (A.39) and (A.40) follow if 



sup 

d£M 



- ^log^A^drVn^ATCdp ' _ ^(0) 



o P (log(n) fe - 2 ) 



(A.41) 
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and 



sup ( f k {9) - GoiVlog^)* 1 = o(log(n) fc - 



(A.42) 



Following Shimotsu (2007), p. 302, notice that, by applying (A. 3), we can rewrite (A. 41) as 



sup 

deM 



3=1 



Define bj(9;k) := log(A^)V^ -,r W r - fl '>/ 2 Aj r+ *', for A; = 0,1,2. Then, by omitting the supre- 
mum, the (r, s)-th element of (A.41) is equal to 



3=1 



i £ bj(0; k) [e^-^-d^^+^frs^ _ G r. 

-i m— 1 J 



b m {6;k) 



£ JV»-«)&-*l)/*\f+*. /"(Aj) - 
i=i 



(A.43) 



where the inequality follows from summation by parts. Now, since 

log(n) fc 



b j (0;k)-b j+1 (d;k) = O 



J 



and b m (d;k) = 0(log(n) fe ), 



uniformly in G for any = 0, 1,2, it follows by Lemma 5.1 and Remark 5.1 that (A.43) 
can be rewritten as 



O 



log(n) fc \ 1 ^ fm a+1 1/2l , A 
m J m \ n a ) 



O log(n) ft -O, 



^ + m 1 / 2 log(m)) = o P ( log(n) fe - 2 ) , 



where the last equality follows from assumption C4 (see also Remark 5.1), because 



log(n) 2 -O n 
m 



+ m 1 / 2 log(m) 



log(n) 2 log(m) log(n) 2 



m 1 / 2 log(m) m 1 / 2 m 1 / 2 
o(1)Op(1) = Op(l). 



Op(l) 



The other term is dealt analogously, so that (A.41) follows. As for (A.42), the result follows 
from the proof of theorem 2, p. 302, in Shimotsu (2007) (notice that it does not depend on f n ). 
This completes the proof. ■ 
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